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Transcriptomic analyses have identified tens of thousands of intergenic, intronic, and c/s-antisense long noncoding RNAs 
(IncRNAs) that are expressed from mammalian genomes. Despite progress in functional characterization, little is known 
about the post-transcriptional regulation of IncRNAs and their half-lives. Although many are easily detectable by a variety 
of techniques, it has been assumed that IncRNAs are generally unstable, but this has not been examined genome-wide. 
Utilizing a custom noncoding RNA array, we determined the half-lives of -800 IncRNAs and -12,000 mRNAs in the 
mouse Neuro-2a cell line. We find only a minority of IncRNAs are unstable. LncRNA half-lives vary over a wide range, 
comparable to, although on average less than, that of mRNAs, suggestive of complex metabolism and widespread func- 
tionality. Combining half-lives with comprehensive IncRNA annotations identified hundreds of unstable (half-life < 2 h) 
intergenic, c/5-antisense, and intronic IncRNAs, as well as IncRNAs showing extreme stability [half-life > 16 h). Analysis of 
IncRNA features revealed that intergenic and c/5-antisense RNAs are more stable than those derived from introns, as are 
spliced IncRNAs compared to unspliced [single exon) transcripts. Subcellular localization of IncRNAs indicated wide- 
spread trafficking to different cellular locations, with nuclear-localized IncRNAs more likely to be unstable. Surprisingly, 
one of the least stable IncRNAs is the well-characterized paraspeckle RNA Neatl, suggesting Neatl instability contributes to 
the dynamic nature of this subnuclear domain. We have created an online interactive resource (http:// stability. 
matticklab.com) that allows easy navigation of IncRNA and mRNA stability profiles and provides a comprehensive an- 
notation of -7200 mouse IncRNAs. 



[Supplemental material is available for this article.] 

High-throughput studies of mammalian genomes have revealed 
that most of the genome is transcribed in a complex manner, in- 
cluding the production of tens of thousands of long non-protein- 
coding RNAs (IncRNAs) (Carninci et al. 2005; Birney et al. 2007; 
Guttman et al. 2009, 2010). Despite rapid progress, only a small 
proportion of IncRNAs have been functionally characterized 
(Amaral et al. 2011), and little is known about most IncRNAs. One 
particularly poorly understood aspect of IncRNAs is their post- 
transcriptional regulation and metabolism in the cell. Despite an 
absence of genome-wide studies to examine IncRNA stability it has 
been a general expectation that IncRNAs will be less stable than 
protein-coding mRNAs, both due to their lower average level of 
expression and the existence of known unstable classes of IncRNAs 
(Dinger et al. 2009a). These classes, including cryptic unstable 
transcripts (CUTs) in yeast (Wyers et al. 2005), upstream non- 
coding transcripts (UNTs) in Arabidopsis (Chekanova et al. 2007), 
and promoter upstream transcripts (PROMPTs) in humans (Preker 
et al. 2008), have supported the suggestion that many, if not most, 
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IncRNA transcripts are highly unstable (Houseley and Tollervey 
2009). However, the stability of the small number of functional 
IncRNAs that have been determined (Sheardown et al. 1997; Seidl 
et al. 2006; Sone et al. 2007; Askarian-Amiri et al. 2011), suggest 
that IncRNAs, like mRNAs, have a wide diversity of half-lives. 
However, genome-wide methods, which provide a more objective 
view of IncRNAs as a class, are required to test this hypothesis. 

Previous studies have investigated mRNA stability genome- 
wide using microarray technology and RNA sequencing (Raghavan 
et al. 2002; Yang et al. 2003; Narsai et al. 2007; Friedel et al. 2009; 
Sharova et al. 2009; Rabani et al. 2011; Schwanhausser et al. 2011) 
but have not considered IncRNAs. These studies revealed a wide 
variation in stability between different mRNAs and its importance 
in combining with transcription to determine the steady-state 
levels of transcripts and also to control the speed and timing of 
changes in gene expression (Elkon et al. 2010; Rabani et al. 2011). 
For example, the low stability of groups of mRNAs, including those 
that encode transcription factors, is integral to their rapid dynamic 
regulation (Yang et al. 2003; Narsai et al. 2007; Friedel et al. 2009). 
Although changes in transcription are responsible for most dynamic 
changes in gene expression, changes in RNA stability also provide 
a mechanism to regulate gene expression levels (Blattner et al. 2000; 
Sharova et al. 2009; Rabani et al. 2011). 

Here, we perform a genome-wide analysis of IncRNA stability 
using custom microarrays that examine —7200 IncRNAs together 
with —20,000 protein-coding transcripts. Although the average 
turnover of IncRNAs is higher than mRNAs, we find that IncRNAs 
display a wide range of stabilities comparable to that of protein- 
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coding transcripts. The functionally characterized IncRNA Neatl 
was also found to be highly unstable, highlighting the notion that 
high turnover is not refractory to function and rather can promote 
the dynamic nature of IncRNAs and the cellular constituents with 
which they are associated. 

Results 

Genome-wide determination of RNA half-lives 

To examine the stability of IncRNAs, we inhibited transcription in 
mouse Neuro-2a (N2A) neuroblastoma cells with actinomycin D 
(Hurwitz et al. 1962) and measured RNA levels over a 32-h time 
course. In the absence of transcription, the percentage of an RNA 
remaining is determined by its rate of decay. Use of an extended 
time course, similar to that conducted for mRNAs in Arabidopsis 
(Narsai et al. 2007), allows improved half-life measurements for 
transcripts with slow decay rates (Friedel et al. 2009) compared to 
previous transcriptional inhibition experiments in mammals that 
were continued to only 8 h (Raghavan et al. 2002; Yang et al. 2003; 
Sharova et al. 2009). We validated the transcriptional blocking 
methodology by determining the stability of the Myc oncogene 
mRNA, which is known to have a very short half-life (Dani et al. 
1984; Raghavan et al. 2002), by qPCR. Consistent with previous 
observations, Myc mRNA was highly unstable (Fig. 1 A) with a half- 
life of 33 min (95% confidence interval 25-48 min). 

Genome-wide analysis of IncRNA stability was conducted 
using NCode (Life Technologies) microarrays, which contain 
probes that target —7200 distinct IncRNA transcripts as well as 
probes that target most annotated protein-coding genes. As re- 
ported previously (Sharova et al. 2009), the most stable transcripts 
increased in relative abundance in samples from later time points 
and so give the artifactual appearance of up-regulation. Therefore, 



we normalized the scale of the array expression data to a pool of 
ultra-stable control transcript(s). Gapdh had been used to normal- 
ize shorter time courses in a previous study (Raghavan et al. 2002), 
but it is not stable over 32 h (Supplemental Fig. SI), and scaling to 
a single mRNA over a long time course could introduce noise. We 
recently identified the IncRNA Zfasl (1500012F01Rik) as highly 
stable with no degradation over a 16-h time course (Askarian-Amiri 
et al. 2011) and were able to confirm by qPCR that its relative ex- 
pression was virtually unchanged after 32 h (Fig. IB). As expected, 
the microarray data showed a relative increase in Zfasl expression 
by 32 h (Fig. 1C). Therefore, we selected six genes showing similar 
profiles to Zfasl that also had previous evidence of high stability in 
mouse cells (Friedel et al. 2009). This high stability was confirmed 
by qPCR analysis of two of these mRNAs, Atp5e and Gstml (Sup- 
plemental Fig. SI), which showed little or no change in abundance 
over the time course. The average fold change between 0 h and 
each treated time point for these seven genes was adjusted to zero, 
and the intensity values of the probes were scaled proportionately 
to create the normalized time course data set. 

We identified 14,987 transcripts that were expressed signifi- 
cantly above background (Supplemental Table SI), from which we 
calculated transcript half-lives by modeling each transcript with 
one-phase exponential decay or linear decay. As one-phase expo- 
nential decay is not optimal for modeling some highly stable 
transcripts, we calculated half-lives using both exponential decay 
and linear decay and selected the best model for each transcript 
(see Methods). Examination of transcript half-lives revealed that 
linear decay began to be favored for transcripts with half-lives 
above 18 h. After filtering to remove transcripts that could not be 
accurately modeled, we determined the half-lives for 12,670 
transcripts (including 823 IncRNAs), (Supplemental Table S2). 
Examples of half-lives calculated from the microarray data are 
shown in Figure 1D-F. 
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Figure 1. RNA half-life determination following transcription inhibition. (A,B) Transcript decay curves after blocking transcription in N2A with acti- 
nomycin D and measuring transcript remaining relative to a control gene by qPCR. Error bars show standard deviation. (A) Myc decay relative to Gapdh. 
Gapdh is a suitable control gene for transcripts that are not highly stable. Results are from four biological replicates, which were subsequently used for 
microarray analysis. The fitted curve was modeled by one-phase decay using nonlinear least squares regression. Myc expression was also tested in mock 
treated time courses, which did not show evidence of transcript decay. (B) Transcript decay curve for Zfasl relative to Atp5e. Results from three biological 
replicates. No degradation is observed, and nonlinear regression supports a horizontal line fit. (C) Zfasl expression over 32-h time course following 
transcription inhibition from microarray. Four biological replicates; error bars show standard deviations. Nonlinear regression was used to test model fits 
and supports a linear fit with a positive slope showing apparent up-regulation of expression by 32 h. (D-F) Decay curves and half-lives determined for two 
random mRNAs (D,f) and one IncRNA (F) transcript from microarrays. All were modeled using one-phase exponential decay. Error bars represent standard 
deviations. 
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To confirm modeled transcripts were within the dynamic 
range of the arrays, we compared transcript half-lives with their 
expression values at 0 h. We did not observe increased half-lives at 
low expression levels, which could have suggested that array 
background was preventing the decay of signal. This observation 
held true whether comparing all RNAs, or IncRNAs only (Supple- 
mental Fig. S2). Indeed, there was a significant correlation (r = 0.2045, 
Spearman correlation) between higher expression and longer half- 
lives for all RNAs, but no correlation between IncRNA expression 
and half-life (Supplemental Fig. S2). 

We validated that the arrays were sufficiently sensitive to 
detect unstable transcripts by comparing the calculated half-life for 
Myc as determined by qPCR to that determined by the microarrays, 
as well as the array stability of Znfxl, which we recently showed to 
be unstable in N2A cells by qPCR (Askarian-Amiri et al. 201 1). The 
array-determined half-life of Myc was 34 min, which is in excellent 
agreement with qPCR, as was that of Znfxl (65 min by array versus 
50 min by qPCR). These data confirmed the accuracy of the array 
results and our ability to determine the half-lives of short-lived 
transcripts. 

GO analysis of stable and unstable protein-coding genes 

Previous studies have shown that unstable protein-coding transcripts 
are enriched for transcription factors, while mRNAs encoding pro- 
teins for various cellular metabolic processes are enriched amongst 
stable transcripts (Yang et al. 2003; Friedel et al. 2009; Sharova et al. 
2009; Rabani et al. 2011). To assess the biological robustness of our 
data, we performed a GO analysis of protein-coding transcripts 
found within the unstable (half-life < 2 h) and highly stable (half -life 
> 12 h) fractions using Fatigo (Medina et al. 2010). Consistent with 
previous studies, we found that genes involved in transcription and 
regulation of gene expression were enriched in the unstable frac- 
tion, especially proteins with transcription factor activity, which 
were strongly overrepresented (P =1.59 X 10~ 7 ), while genes in- 
volved in numerous cellular metabolic processes and with oxi- 
doreductase activity were enriched amongst the highly stable 
mRNAs (Supplemental Table S3). Together, these observations 
suggest our experimental model for RNA decay is applicable for 
making biologically relevant interpretations. 

Stability of IncRNAs 

To examine the stability of IncRNAs and how they compare to 
mRNAs, we graphed the distribution of half-lives. Figure 2A reveals 
that transcript half-lives ranged from <30 min to >48 h. LncRNAs 
show a similar range of half-lives to protein-coding transcripts, 
suggesting that IncRNA stability is a regulated process. The median 
IncRNA half-life was 3.5 h (mean 4.8 h), whereas the median half- 
life for protein-coding transcripts was 5.1 h (mean 7.7 h). This re- 
sult shows that IncRNAs are not unstable as a class, although their 
half-lives are, on average, shorter than protein-coding RNAs (P < 
0.0001) (Fig. 2A). We also find that a higher percentage of IncRNAs 
are classified as unstable (29% versus 17%) (t!/ 2 < 2 h; P < 0.0001) 
(Fig. 2B,C) and a lower percentage as highly stable (6% versus 1 7%) 
(t'/ 2 > 12 h; P < 0.0001) (Fig. 2B,D). 

In total, we identified —240 unstable IncRNA transcripts (tVz < 
2 h). Although most IncRNAs are yet to be annotated, this group 
contains a number of known and functionally validated IncRNAs. 
These include Neatl (see below), the probable mouse Adapt33 
(5430416N02Rik) homolog, a number of IncRNAs that associate 
with chromatin-binding proteins (Guttman et al. 2011) (see be- 
low), small RNA host transcripts including Mirl7hg, an isoform of 




Figure 2. Half-lives of IncRNA and protein-coding transcripts. (A) Box- 
and-whisker plot of coding and IncRNA transcript half-lives. (Whiskers) 1 st- 
99th percentile, with individual transcripts outside this shown as dots. (Box) 
25th-75th percentile. Difference calculated using a non parametric Mann- 
Whitney t-test. (B) Frequency distribution showing the fraction of protein- 
coding and IncRNA transcripts in 2-h bins. Plotted points are at the center of 
the 2-h bin. Only time points with 1 % or more of transcripts are plotted. (C) 
Percentage of unstable (half-life under 2 h) IncRNA and protein-coding 
transcripts. Significant difference calculated using a x 2 test. (D) Percentage 
of highly stable (half-life over 1 2 h) IncRNA and protein-coding transcripts. 
Significant difference calculated using a x 2 test. 



Rmst, and several snoRNA host genes including Snhg3, Snhg5, and 
one Snhg8 isoform, as well as the imprinted IncRNAs Kcnqlotl and 
Pegl3. The low stability of several snoRNA host transcripts is in 
stark contrast to the extreme stability of Zfasl, which is also a 
snoRNA host gene. 

Fifty-one IncRNAs had half-lives of over 12 h and were, 
therefore, classified as highly stable. These IncRNAs corresponded 
to a variety of genomic context classifications including inter- 
genic, bidirectional with other transcripts, and intronic to protein- 
coding loci. Few have been characterized in detail with the 
exception of 4933436C20Rik (Lincl399), identified as involved in 
maintaining pluripotency in embryonic stem cells, where it asso- 
ciates with the chromatin-binding protein CBX3 (Guttman et al. 
2011). Given that we have characterized the stability of only 
a subset of the total number of IncRNAs, these results suggest that 
there are a large number of highly stable IncRNAs. Although the 
function of these transcripts is generally unknown, genes with 
enzymatic and housekeeping functions are enriched amongst 
stable protein-coding RNAs, so we hypothesize that housekeeping 
IncRNAs will also show high stability. 

Clustering of RNA decay profiles 

Transcripts of different stabilities provide a rich variety of decay 
profiles. To visualize these, we clustered all IncRNAs using un- 
supervised hierarchical clustering (Fig. 3). These results further 
reinforce the notion that IncRNAs are not generally unstable but 
rather show a wide range of stability profiles. 

Previous analyses of coding transcript stability have revealed 
relationships between stability and gene function and that mRNAs 
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Figure 3. Hierarchical clustering of transcript decay rates. Unsupervised 
hierarchical clustering of all I ncRNA transcripts above the expression cut-off. 
Clustering was performed using cluster3 (de Hoon et al. 2004) and visual- 
ized in Java Treeview (Saldanha 2004). All transcripts were set to an ex- 
pression level of 1 at 0 h, so clustering is determined only by decay rate. 
Transcripts that decay quickly turn black during the early time points; 
transcripts that show no degradation remain bright yellow. As clustering was 
performed on all transcripts above the expression cut-off, transcripts whose 
half-lives could not be determined are also included. 



that act in common biological pathways (same GO function) 
(Sharova et al. 2009) or proteins found within the same complexes 
(Friedel et al. 2009) often have similar stabilities. On this basis, we 
hypothesize that the stability of IncRNAs may be similarly related 
to their functional classes and that IncRNAs with related functions 
may tend to cluster together according to their stability profiles. 

To reveal potential relationships between stability and func- 
tion, we reclustered all expressed transcripts by their decay profiles 
using QAPgrid (Inostroza-Ponta et al. 2011). The decay profile for 
each individual cluster is shown in Figure 4. As a result of this 
optimization-based approach to visualization, the distance between 
each cluster in the final layout is related to the difference between 
the decay profiles. From our data set of 14,987 decay profiles, the 
method produced 152 clusters, which, in turn, aggregate into seven 
"super-clusters" containing related decay profiles. The 152 clusters 
contain from 2 to 4447 transcripts. Most transcripts are found in 
a relatively small number of clusters, with 29.7% present in the 
single most populous cluster (Cluster 0), which is part of the largest 
super-cluster (Super-cluster 0). Indeed, the ten most populous in- 



dividual clusters contain 79.9% of all transcripts, from the generally 
highly unstable transcripts in Cluster 1 (1087 transcripts) through 
to the extremely stable transcripts in Clusters 15 (196 transcripts) 
and 27 (797 transcripts). Hence, it appears that a reasonably small 
number of clusters and super-clusters can define the dynamics of 
RNA degradation. 

To facilitate the browsing and visualization of transcript sta- 
bilities, we developed an interactive, web-accessible version of the 
QAPgrid representation of the clusters (http: //stability. matticklab. 
com/). Each decay cluster is linked to a table of the transcripts it 
contains, together with their associated half-lives and further an- 
notations including transcript classification, structural features, 
conservation, subcellular localization and expression information. 

Comparing the annotated transcripts present in different super- 
clusters (i.e., independent of modeled half-lives) revealed a number 
of features overrepresented within individual super-clusters. For ex- 
ample, Super-cluster 1 showed an enrichment for genes encoding 
nuclear proteins (P < 5.72 x 10~ 60 ) involved in the regulation of 
transcription (P< 4.40 X 10~ 56 ) as estimated by giprofiler (Reimand 
et al. 2011), while a MEDLINE-based analysis using GATHER (Chang 
and Nevins 2006) revealed a significant enrichment of genes asso- 
ciated with the term "oncogene" (P < 0.0003) (Supplemental Table 
S4). Super-cluster 1 also had the highest proportion of IncRNAs 
(13.5%). Taken together these analyses demonstrate how QAPgrid 
clustering facilitates both the extraction of biologically meaningful 
data from, and visualization of, genome-wide data sets. 

Combining clustering and RNA half-lives to identify 
potentially independent transcripts 

The identification of transcripts from common loci with different 
half-lives or occurrence within different super-clusters provides a po- 
tential means to disentangle independent transcripts from complex 
loci. This is useful as such loci can contain numerous overlapping 
transcripts with alternative splice sites and varying start and end 
positions, which can be very challenging to distinguish. 

We searched the four most populous super-clusters (0, 1, 3, 
and 5, containing 94% of clustered probes) for Unigene genes with 
multiple probes that clustered into more than two super-clusters. 
This analysis identified 21 genes (Supplemental Fig. S3; Sup- 
plemental Table S4). One gene identified was Tcf4 (transcription 
factor four). Two probes to constitutive splice junctions showed 
half-lives of 14.5 and 6.7 h, respectively, whereas three probes 
distributed across the 5-kb Tcf4 3'UTR (Supplemental Fig. S4) all 
identified a half -life of 3.4-3.5 h, The high concordance and di- 
vergent half-life of the 3'UTR suggests its regulation is independent 
of the common coding isoform, potentially due to processing of 
the 3 'UTR (Mercer et al. 201 1) or due to specific use of the 3 'UTR by 
an alternative Tcf4 isoform. This differential stability across the 
gene locus would be difficult to detect by standard expression ar- 
rays or next-generation sequencing, but regulation of this nature is 
readily identifiable from transcript decay. 

Another example that illustrates the value of stability pro- 
filing for disentanglement of transcripts is the noncoding snoRNA 
host gene Snhg4, which terminates ~4 kb upstream of the coding 
gene Matr3. Both cDNA cloning (Bortolin and Kiss 1998) and next- 
generation sequencing (Guttman et al. 2010) suggest Snhg4 is 
commonly a 5' upstream start site for Matr3 (Supplemental Fig. S4). 
However, our data show that Snhg4 is unstable (and found in Su- 
per-cluster 1), whereas Matr3 is highly stable (and found in Super- 
cluster 5), suggesting that these transcripts can be distinct and 
independently regulated. 
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Figure 4. Distance-based clustering of transcript decay rates. Transcripts are automatically clustered; with those showing indistinguishable decay 
profiles over the time course present in the same cluster, while transcripts with similar profiles are found in nearby clusters. The physical distance between 
individual clusters and between super-clusters represents the degree of difference in the decay profile. Clusters of clusters or super-clusters are created by 
applying the method used to form the clusters to the clusters themselves. Title gives the cluster number and the number of probes in the cluster. (X-axis) 
Cumulative expression of all probes in the cluster. Decay profiles are stacked bar graphs with every stack representing a separate transcript; when there are 
many transcripts in a cluster, the expression level of some transcripts cannot be individually visualized and are seen as areas of black (representing many 
transcripts). An interactive version of this figure can be found at http://stability.matticklab.com/. 



Genomic location impacts IncRNA stability 

Previous studies have defined a number of sequence elements that 
influence mRNA stability (Yang et al. 2003; Narsai et al. 2007; 
Sharova et al. 2009). To further understand how IncRNAs differ in 
stability and why they are, on average, less stable than mRNAs, we 
investigated the relationship between IncRNA stability and their 
corresponding sequence elements and features. 

To analyze various features that could affect IncRNA stability, 
we created a comprehensive set of genome-wide IncRNA annota- 
tions (Supplemental Table S2), which represent the most detailed 
annotations of mouse IncRNAs to date. An important feature of 
IncRNAs is the genomic location they are transcribed from in re- 
lation to protein-coding genes (Mercer et al. 2009). On this basis, 
IncRNAs can be broadly classified as intronic, where they originate 
from within coding gene loci, ds-antisense, where they are tran- 
scribed from the opposite stand to other transcripts, or intergenic, 
where they are transcribed outside of, or between, coding genes 
(Supplemental Fig. S5). Comparison of the stability of intergenic, 
intronic and ris-antisense IncRNAs revealed significant differences 
between the groups, with intronic IncRNAs less stable than those 



from intergenic regions or antisense to other transcripts (P < 0.001) 
(Fig. 5A) with a much greater percentage of intronic IncRNAs 
classified as unstable (tVz < 2 h) (P < 0.0001) (Fig. 5B,C), although 
there was no difference between the number classified as highly 
stable (VA > 12 h) (P = 0.34, x 2 test). 

LncRNAs can be transcribed from other genomic loci in- 
cluding bidirectionally (either head-to-head or tail-to-tail) with 
other transcripts (Engstrom et al. 2006), 3' from coding genes 
(uaRNAs) (Mercer et al. 2011) and from promoter regions (pro- 
moter-associated) (Preker et al. 2008) (Supplemental Fig. S5). An- 
other subset of IncRNAs are large intergenic noncoding RNAs 
(lincRNAs) (Guttman et al. 2009, 2010), many of which have 
critical roles in differentiation and development (Guttman et al. 
2011). We identified the half-lives for 90 transcripts from lincRNA 
regions, including LincRNA-p21 (Gml6197), which acts downstream 
from p53 (also known as TP53) to repress many genes (Huarte et al. 
2010) and which was quite stable with a half-life of over 6 h. 

Comparing all classes of IncRNAs revealed that intergenic, ds- 
antisense, tail-to-tail bidirectional and uaRNA transcripts are sig- 
nificantly more stable (on average) than intronic and promoter 
associated IncRNAs (P < 0.01) (Fig. 5D). Intergenic and ds-anti- 
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Figure 5. Effect of IncRNA features on stability. (A) Comparison of the stability of intronic versus inter- 
genic and c/s-antisense IncRNAs. Box-and-whisker plot. (Whiskers) 1st-99th percentile, with individual 
transcripts outside this shown as dots. (Box) 25th-75th percentile. Difference calculated using one-way 
ANOVA with Kruskal-Wallis nonparametric test and Dunn's post-test to compare individual annotations. (£) 
Frequency distribution showing the fraction of IncRNA transcripts in 2-h bins. Plotted points are at the center 
of the 2-h bin. Only time points with 1 % or more of transcripts are plotted. (C) Percentage of unstable (half- 
life under 2 h) IncRNA intergenic, c/s-antisense, and intronic transcripts. Significant difference calculated 
using x 2 test. (D) Comparison of the stability of all IncRNA genomic classifications. Box-and-whisker plot. To 
focus on the center of the distribution, whiskers show 10th-90th percentile, with individual transcripts 
outside this shown as dots, and only half-lives between 0.2 h and 20 h are shown. Significant differences 
found between stability of classes using one-way ANOVA with Kruskal-Wallis nonparametric test and Dunn's 
post-test to compare individual annotations. (*) Level of significance common to all comparisons. (£) 
Comparison of the stability of localized transcripts. Box-and-whisker plot and statistical testing as per A. (F) 
Frequency distribution showing the fraction of nuclear, cytoplasmic, and nonenriched transcripts in 2-h bins. 
Plotted points as per B. (*) P < 0.05, (**) P < 0.01 , (***) P < 0.001 . 



including modification of epigenetic 
state, alternative splicing, and RNA poly- 
merase binding efficiency (Dinger et al. 
2008; Khalil et al. 2009; Zhao et al. 2010; 
Amaral et al. 2011). Proteins with similar 
regulatory roles, such as transcription 
factors, typically have short half-lives 
and are nuclear-localized. Therefore, to 
identify IncRNAs with potential regula- 
tory roles in the nucleus, we interrogated 
total RNA isolated from nuclear and cy- 
toplasmic fractionated N2A cells with the 
NCode microarrays. Successful isolation 
of pure nuclei and cytoplasmic fractions 
was confirmed by qPCR and Western blot 
(Supplemental Fig. S7). The array data 
were validated by qPCR for a number of 
genes ranging from nuclear-specific to 
cytoplasmically enriched (Supplemental 
Fig. S8) and also by the high nuclear en- 
richment of known nuclear IncRNAs, 
such as Xist and Bacelas. Array analysis 
revealed 499 IncRNAs that were signifi- 
cantly enriched in the cytoplasm and 
191 in the nucleus. Intersection with 
lincRNAs and IncRNAs listed in IncRNAdb 
revealed a number of known and unchar- 
acterized localizations of IncRNAs, includ- 
ing nuclear enrichment of Adapt33, Zfasl, 
and several other snoRNA host genes. 
Taken together, these results support the 
widespread trafficking of IncRNAs to dif- 
ferent subcellular locations. 

Among the subset of IncRNAs for 
which a half-life could be determined, we 
identified 22 that were significantly en- 
riched in the cytoplasm and 105 that were 
enriched in the nucleus. Nuclear-localized 
transcripts displayed a significantly lower 
stability (Fig. 5E) with a greater pro- 
portion (52%) of the nuclear-enriched 
transcripts classified as highly unstable 
(P < 0.0001, x 2 test) (tVz < 2 h) (Fig. 5F; 
Supplemental Fig. S9). In contrast, stable 
transcripts (tVz > 6 h) were not enriched in 
either compartment (P = 0.26, x test). 

We considered whether the unstable 
nature of nuclear-localized transcripts 
was due to an overabundance of unsta- 



sense transcripts were also more stable than head-to-head bi- 
directional transcripts (P < 0.05). Hierarchical clustering of IncRNA 
stability profiles supported this result. The most stable classes (inter- 
genic, c/s-antisense, tail-to-tail and uaRNAs) clustered together, as did 
the unstable group of intronic, promoter-associated and head-to- 
head bidirectional IncRNAs, while lincRNAs showed intermediate 
characteristics (Supplemental Fig. S6). 

Comparison of cellular localization with stability 

Functional studies of IncRNAs have shown that many act as repressors 
or activators of gene expression through various mechanisms, 



ble IncRNA genomic classes (intronic, 
promoter-associated, and head- to-head). Head- to-head transcripts 
were, instead, underrepresented in the nucleus (P = 0.0154, Fisher's 
exact test). An increased prevalence of exosome- targeted transcripts 
similar to CUTs and PROMPTS could also help explain the low sta- 
bility of nuclear transcripts; however, there was no enrichment for 
promoter-associated IncRNAs in the nucleus (P = 0.74, Fisher's exact 
test) or among the unstable nuclear fraction (P = 1.0, Fisher's exact 
test). Although intronic transcripts were enriched in the nuclear 
fraction (P = 0.0005, Fisher's exact test), nuclear-localized intronic 
IncRNAs were less stable than those intronic IncRNAs not found in 
the nucleus (P = 0.015, Mann Whitney t-test). Taken together, these 
results suggest the impact on stability is due to the cellular location 
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of these IncRNAs, rather than resulting 
from their genomic classification. 

The relatively high turnover of nu- 
clear-localized IncRNAs is consistent with 
the notion that these IncRNAs are acting 
as regulatory molecules, which, like their 
protein-coding counterparts, are turned 
over rapidly to provide a high degree of 
dynamism to the processes they regulate. 
We find preliminary evidence to support 
this hypothesis by investigating the sta- 
bility of RNAs identified as interacting 
with chromatin-binding proteins (Amaral 
et al. 2011; Guttman et al. 2011), which 
are less stable than IncRNAs (P = 0.049, 
Mann Whitney t-test), more likely to be 
unstable (P = 0.017, Fisher's exact test), 
and more likely to be nuclear-localized 
(P = 0.028, Fisher's exact test). 
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LncRNA decay elements 

A number of decay elements have been 
detected, predominantly in the 3'UTR, 
of mRNAs, including AU-rich elements 
(AREs) (Bakheet et al. 2001) and Puf 
family protein-binding sites (Xie et al. 
2005). Spliced mRNA transcripts are more 
stable than single exon mRNAs (Narsai 
et al. 2007), and there are positive corre- 
lations between both the number of in- 
trons and the density of splice junctions 

per kb of ORF length and stability (Sharova et al. 2009). To in- 
vestigate whether these factors known to affect coding transcript 
stability also impact the stability of noncoding RNAs, which would 
suggest common regulatory and decay pathways, we analyzed the 
splicing status and putative AREs and Puf -binding sites of IncRNAs 
in the context of their stability. Similar to mRNAs, we find that 
spliced IncRNAs are more stable than single exon transcripts 
(P < 0.0001) (Fig. 6A) and that single exon transcripts are over- 
represented among unstable transcripts (VA < 2 h) (P < 0.0001) 
(Fig. 6B; Supplemental Fig. Sll). However, unlike mRNAs (Sup- 
plemental Fig. S10), no correlation was found between the stability 
of spliced transcripts and the number of introns or density of splice 
junctions (Supplemental Fig. Sll). Furthermore, no significant 
relationship was found between IncRNA stability and predicted 
ARE or Puf-binding sites (Supplemental Fig. Sll), although the 
closeness to statistical significance for ARE sites (P = 0.091, Spear- 
man correlation), suggests such a relationship might be present 
with a larger sample size. As expected, we did observe a negative 
relationship between the presence of these motifs and mRNA sta- 
bility (Supplemental Fig. S10). 

Utilizing our genome-wide IncRNA annotations, we exam- 
ined other properties of IncRNAs that could impact stability. 
LncRNAs with a polyadenylation signal had higher stability than 
those transcripts that appeared internally primed (P < 0.0001) (Fig. 
6C; Supplemental Fig. Sll). We also identified positive correlations 
between the stability of IncRNAs and increases in transcript con- 
servation and GC% percentage (Fig. 6D; Supplemental Fig. Sll). 
These relationships were also identified in mRNAs (Supplemental 
Fig. S10). Although current long RNA structure prediction ap- 
proaches are limited, GC% is a simple proxy for the potential of 



GC% 



Figure 6. LncRNAs and decay elements. (A) Comparison of the stability of single exon versus spliced 
IncRNAs. Box-and-whisker plot. (Whiskers) 1st-99th percentile, with individual transcripts outside this 
shown as dots. (Box) 25th-75th percentile. Difference calculated using a nonparametric Mann -Whitney 
t-test. (B) Frequency distribution showing the fraction of single exon versus spliced IncRNAs in 2-h bins. 
Plotted points are at the center of the 2-h bin. Only time points with 1 % or more of transcripts are plotted. 
(C) Comparison of the stability of IncRNAs containing a major or minor polyA signal versus those with 
evidence of internal priming. Box-and-whisker plot and statistical testing as per A (D) Correlation between 
GC% and IncRNA half-life. Spearman correlation = 0.0852 (P = 0.0145) indicates a small positive re- 
lationship between increased half-life and IncRNAs with higher GC%. Spearman correlation utilized because 
data is non-Gaussian. Trend line shows a semilog fit from nonlinear regression. Axes are logl 0-linear. 



a RNA to fold into secondary structures due to the higher stability 
of GC base pairs. The positive relationship between GC% and 
stability suggests that IncRNAs with more structural elements are, 
perhaps not surprisingly, more stable. 

Finally, we also investigated whether there were any charac- 
teristics of IncRNAs that could explain why they were less stable 
than mRNAs. Unexpectedly, we observed similar proportions of 
mRNAs and IncRNAs localized to the nucleus, with nuclear- 
enriched mRNAs also displaying lower stability than cytoplasmic or 
nonenriched transcripts (P < 0.001, one-way ANOVA). These results 
suggest it is not the frequent localization of IncRNAs to the nucleus 
that results in their lower stability relative to mRNAs; instead, the 
differential stability may be due to other RNA features. We find that 
many RNA characteristics that positively correlate with stability 
in IncRNAs and/or mRNAs, such as whether or not a transcript is 
spliced, the density of splice junctions, GC%, and degree of evolu- 
tionary conservation, have much lower values in IncRNAs compared 
to mRNAs (all P < 0.0001, Mann Whitney t-tests and \ tests). For 
example, the average intron density in IncRNAs is much lower than 
for mRNAs, which may explain why the correlation for IncRNAs was 
positive but not significant; i.e., too few IncRNAs have sufficient 
intron density to provide measureable stabilization, as is the case for 
mRNAs. Taken together, we find evidence that a number of sequence 
features known to affect mRNA stability also impact IncRNAs and 
provide some insight into why IncRNAs are, on average, less stable. 



Comparison of IncRNA stability between mouse and human 

We investigated the conservation and variability in IncRNA half- 
lives between mouse and human. Searching the stability micro- 
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arrays performed by Friedel et al. (2009) on human B cells and 
mouse 3T3 cells for probes corresponding to the genomic locations 
of IncRNAs from IncRNAdb (Amaral et al. 2011), we identified 11 
IncRNAs with measured half-lives in both species (Supplemental 
Table S5). While most IncRNAs showed similar half-lives between 
the two species, four were significantly different, showing higher 
stability in human (all P < 0.001) (Fig. 7A). For example, MALAT1, 
which is found in SC35 splicing speckles and plays a role in the 
regulation of splicing (Tripathi et al. 2010) was highly stable in 
human B cells, with a half-life of 16.5 h, compared to 3 h in mouse 
3T3 cells. Investigation of Malatl in mouse N2A cells by qPCR 
identified a half-life of 4 h (Supplemental Fig. S13), in good 
agreement with Friedel et al. (2009) but significantly shorter than 
determined previously in mouse embryonic fibroblasts (Bernard 
et al. 2010). NEAT1 was also stable in human cells but unstable in 
mouse (investigated further below). 

We recently reported on the extreme stability of the Zfasl 
IncRNA, which is expressed from a bidirectional promoter with the 
Znfxl protein-coding gene (Askarian-Amiri et al. 2011) and showed 
no evidence of degradation after 16 h of transcriptional inhibition. 
Using the improved stable control genes, we extended this analysis 
to find that Zfasl levels remain unchanged even after 32 h (Fig. IB). 
This remarkably high level of stability may explain its very high 
expression level in N2A cells (top 0.5% of all expressed transcripts). 
In contrast, the stability of Zfasl in mouse 3T3 cells and in human 
revealed much shorter half-lives of 1.7 and 3 h, respectively (Friedel 
et al. 2009). Friedel et al. (2009) did not block transcription to de- 
termine RNA half -lives; however, this result appears independent of 
the method used, as another study of mRNA half-lives in mouse 
embryonic stem cells using actinomycin D found the Zfasl half -life 
to be 63 min (Sharova et al. 2009). Rather, these results suggest there 
can be large changes in IncRNA stability both within and between 
species and highlights the dynamic nature of IncRNA post-tran- 
scriptional regulation. 

The architectural IncRNA Neatl is highly unstable 

Neatl is a structural RNA involved in the formation and integrity 
of nuclear paraspeckles (Bond and Fox 2009; Sunwoo et al. 2009; 
Mao et al. 2011). Surprisingly, given its identification as a func- 
tional "architectural" RNA and similar to Friedel et al. (2009), Neatl 
was very unstable with a half-life of <30 min by array and 15 min 
by qPCR (Fig. 7B). As Neatl is unspliced and has short, —3.2 kb 
(Neatl _vl /Men epsilon) and long, —20 kb (Neatl_v2/Men beta) 
isoforms transcribed from a single promoter (Guru et al. 1997; 
Sunwoo et al. 2009), both the array and qPCR were quantifying the 
combined stability of the short and long isoforms (Fig. 7B,C). 
Neatl _v2, but not Neatl _vl, is able to rescue a Neatl knockdown 
(Sasaki et al. 2009) and is required for paraspeckle formation in 
mice (Nakagawa et al. 2011), suggesting this isoform is more cen- 
tral to Neatl function. Although qPCR specific to this isoform 
revealed it was significantly more stable than the combined short 
and long isoforms (P < 0.05), the long isoform is still unstable with 
a half-life of 60 min (Fig. 7D,E). 

To investigate whether the high instability of Neatl affected 
the formation of paraspeckles, we performed RNA-protein FISH in 
N2A cells. This revealed only a few, small nuclear speckles within 
a minority of cells by RNA FISH, which was consistent with the lack 
of localization of the paraspeckle protein NONO to nuclear speckle 
structures (Fig. 7G; Supplemental Fig. SI 2). N2A cells are "un- 
differentiated," and the up-regulation of Neatl (and formation of 
paraspeckles) upon differentiation has been reported in other cell- 



type specific differentiation systems (Chen and Carmichael 2009; 
Sunwoo et al. 2009). To examine the possibility that Neatl up- 
regulation may be partially post-transcriptional and that low sta- 
bility may prevent proper paraspeckle formation, we investigated 
Neatl stability in 3T3 cells, which have abundant paraspeckles (Fig. 
7H; Supplemental Fig. S12). The half-lives of Neatl (vl and v2) and 
Neatl_v2 were calculated as 32 min (95% CI = 24 min-47.8 min) 
and 63 min (95% CI = 47.3 min-93 min), respectively, indicating 
that there was no difference in Neatl stability between cells with 
and without paraspeckles. Subtracting the expression of Neatl _v2 
from Neatl (vl and v2) revealed that Neatl_vl was significantly less 
stable than the long isoform, with a half-life of 19.6 min (95% CI = 
15.22 min-27.59 min) (Fig. 7F). Therefore, we conclude that the 
presence/absence of paraspeckles in these cells is not related to 
Neatl stability. Furthermore, the highly unstable nature of Neatl 
demonstrates that instability is not a barrier to IncRNA function 
and that, instead, the high turnover of Neatl could contribute to 
the highly dynamic nature of paraspeckles (Mao et al. 2011). 

Discussion 

Contrary to some expectations, examination of IncRNA stability 
and comparison to that of protein-coding genes has revealed that 
IncRNAs are not generally unstable but rather show a wide varia- 
tion in their stability profiles in a manner similar to mRNAs. In 
light of several studies that show remarkably specific expression 
profiles for IncRNAs in diverse tissues and development systems 
(Dinger et al. 2008; Guttman et al. 2010; Cabili et al. 2011), as well 
as a recent large-scale screen showing functions for many IncRNAs 
in embryonic stem cell differentiation (Guttman et al. 2011), the 
large variation in IncRNA stability is consistent with their func- 
tional diversity and is likely a reflection of their complex post- 
transcription regulation. Indeed, post-transcriptional regulation is 
particularly important for IncRNAs because, unlike protein-coding 
genes, they do not have any further translational and post-trans- 
lational opportunities for regulation. We find that a number of 
IncRNA characteristics correlate with stability, including genomic 
location, subcellular localization, splicing, and GC percentage. 
Other characteristics, such as the density of exon junctions, had no 
measurable impact on IncRNAs despite providing the strongest 
correlation with the mRNA stability of those tested. LncRNA ex- 
pression level also was not correlated with stability, suggesting 
that IncRNAs below the expression cutoff will not be generally 
unstable either, although this possibility cannot be completely 
discounted. Overall, we find that analysis of IncRNA sequence 
features is an effective means of gaining insight into factors un- 
derlying stability. 

A large number of post-transcriptional regulatory pathways 
act by regulating RNA stability and degradation. Some, such as the 
nuclear and cytoplasmic exosomes, have been found to degrade 
IncRNAs (including CUTs, UNTs, and PROMPTS) and regulate their 
function (Camblong et al. 2007; Chekanova et al. 2007; Berretta 
et al. 2008; Preker et al. 2008; van Dijk et al. 2011). How other 
pathways, such as miRNAs and nonsense mediated decay (NMD), 
act on IncRNAs is poorly understood. In Arabidopsis, putative 
IncRNAs can be regulated by NMD (Tycowski et al. 1996; Kurihara 
et al. 2009), a process thought to require a pioneer round of tran- 
slation (Maquat et al. 2010), which could suggest "noisy" trans- 
lation of IncRNAs. Recently, the first example of miRNA regulation 
of IncRNAs was identified (Hansen et al. 2011), suggesting this 
process may be widespread. RNA-binding proteins can also affect 
RNA stability; for example, the presence of the exon-junction 
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Figure 7. Neatl stability. (A) Comparison of IncRNA stabilities in mouse and human from Friedel et al. (2009). Error bars are standard deviations. 
Significant differences determined by two-way ANOVA using Bonferroni multiple comparisons. (***) P < 0.001, (****) p < 0.0001 . (B) Transcript decay 
curve for Neatl (both vl and v2 isoforms) in N2A cells after blocking transcription with actinomycin D and measuring transcript remaining relative to 
Gapdh by qPCR. Four biological replicates. Error bars are standard deviations. Fit modeled by one-phase decay using nonlinear least squares regression. (C) 
Neatl genomic locus showing vl and v2 isoforms plus positions of PCR amplicons, FISH, and microarray probes. (D) Neatl _v2/ long isoform half-life in 
N2A cells, qPCR as per B. (£) Comparison of Neatl (both vl and v2 isoforms) (95% CI = 1 0-32 min) and Neat7_\/2/long isoform (95% CI = 40 min-1 h, 57 
min) in N2A cells. Error bars show 95% confidence intervals. Unpaired t-test. (F) Comparison of stability of Neatl isoforms in 3T3 cells. qPCR from three 
biological replicates. Error bars show 95% confidence intervals. Unpaired t-test. (C,H) Combined RNA protein FISH on N2A cells (C) and 3T3 cells (H). (Left 
panel) Neatl RNA; (second panel) localization of paraspeckle protein NONO; (third panel) DAPI nuclear stain; (final panel) overlay. 
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complex has been hypothesized to be responsible for the increased 
stability of mRNAs with a higher density of splice junctions (Sharova 
et al. 2009). Indeed, we find this appears to apply to IncRNAs, as 
spliced IncRNAs are more stable than those that are unspliced. 

We find that a larger proportion of IncRNAs than mRNAs are 
unstable. Both our results and previous findings from mammals 
and yeast (Seidl et al. 2006; Berretta et al. 2008; van Dijk et al. 201 1) 
have shown that low stability does not mean IncRNAs are non- 
functional. Instead, low stability can be important for regulating 
IncRNA function. For example, stability can determine where 
and when a IncRNA can perform its function, as exemplified by 
PH084 antisense in yeast (Camblong et al. 2007). Furthermore, by 
not requiring translation to produce a functional gene product, 
IncRNAs, especially those acting in the nucleus, can function al- 
most immediately after transcription (Dinger et al. 2009a) and so 
may not require a long half-life. In addition, unstable transcripts 
are very sensitive to changes in the level of transcription and re- 
spond quickly when transcription changes (Elkon et al. 2010; 
Rabani et al. 2011). Low stability, such as that observed in many 
transcription factor mRNAs, allows dynamic gene expression in 
response to environmental signals (both intra- and extracellular) 
that can activate or repress genes. A growing number of IncRNAs, 
have been found to interact with chromatin and chromatin mod- 
ification complexes, and some have been shown to regulate gene 
expression (Dinger et al. 2008; Khalil et al. 2009; Zhao et al. 2010; 
Guttman et al. 2011; Wang et al. 2011). It has previously been 
hypothesized that unstable IncRNA would be suitable for regulat- 
ing gene expression (Dinger et al. 2009a), and indeed, we find that 
RNAs which associate with chromatin binding proteins exhibit 
lower stability, suggesting that IncRNA "transcription factors" with 
analogous properties to their coding mRNA counterparts may 
exist. 

Consistent with our finding that low stability does not mean 
lack of function, we identify the nuclear paraspeckle component 
Neatl as among the least stable IncRNAs. Nuclear paraspeckles are 
highly dynamic nuclear subdomains (Mao et al. 2011). Transcrip- 
tion of Neatl is required to both nucleate and maintain para- 
speckles at the Neatl locus, with paraspeckles quickly disassem- 
bling upon inhibition of Neatl transcription (Mao et al. 201 1). The 
unstable nature of the Neatl isoforms could contribute to this 
dynamic regulation, as paraspeckles formed around a highly stable 
RNA would likely be less susceptible to such quick degradation. In 
human, NEAT1 appears to be more stable than in mouse (Friedel 
et al. 2009; Sasaki et al. 2009; Sunwoo et al. 2009), with unknown 
consequences for NEAT1 function and paraspeckle dynamics. 

Although occurring at a lower incidence than mRNAs, we also 
identified a number of highly stable IncRNAs. Apart from the re- 
cently identified Zfasl (Askarian-Amiri et al. 2011), which was too 
stable for array-based half-life modeling but the stability of which 
has been confirmed by qPCR, few of these IncRNAs have been 
characterized. Highly stable mRNAs often encode highly stable 
proteins with "housekeeping" functions, such as those in central 
metabolism, which do not require dynamic regulation for their 
proper function (Schwanhausser et al. 2011). Similarly, highly 
stable IncRNAs may serve "housekeeping" roles. The existence of 
highly stable IncRNAs also suggests that some IncRNAs have 
evolved to avoid degradation through various mechanisms, such 
as secondary structure and interactions with RNA-binding pro- 
teins, about which little is known. 

Despite the large impact actinomycin D treatment has on 
cells, previous genomic scale studies have shown good agreement 
between actinomycin D treatment and other methods, which do 



not require transcriptional blockade (Dolken et al. 2008; Friedel 
et al. 2009; Rabani et al. 201 1; Schwanhausser et al. 201 1), with the 
proviso that the actinomycin D time course is of sufficient dura- 
tion (>6 h) to determine the half-life of stable transcripts (Friedel 
et al. 2009). Although actinomycin D treatment can stabilize 
transcripts globally, the rank order of transcript stabilities appears 
relatively unchanged (Rabani et al. 2011; Schwanhausser et al. 
2011), supporting the validity of our conclusions regarding IncRNA 
stability. 

The increasing availability of genome-wide technologies such 
as microarrays, high-throughput proteomics and next-generation 
sequencing has led to an explosion of studies featuring massive 
data sets. However, there has been minimal innovation with re- 
spect to the visualization of and accessibility to genome-scale data 
sets, and it is typically a complex process to extract and navigate 
the data sets associated with such studies. To address this issue, 
here we have presented our data set using QAPgrid clustering and 
combined this with an interactive navigation system based on 
Google Maps and an easily searchable database. As a result, users 
can become oriented within the data at various levels, such as by 
browsing the stability profiles or by searching for clusters that 
contain a particular transcript. In addition to providing conve- 
nient access to the data presented here, such an interface can be 
readily adapted to other genome-scale data sets. We also suggest 
that such advances in data dissemination are important to provoke 
further innovation in this important area, which we anticipate will 
become an increasingly essential component in any study pre- 
senting genome-scale data. 

Attempts to dissect the regulatory circuitry of a biological 
system have increasingly led to the application and integration of 
various "omics" approaches. Our data and that of others suggest 
that there are significant correlations between stability profile and 
function (Sharova et al. 2009; Rabani et al. 2011; Schwanhausser 
et al. 2011). Furthermore, transcripts clustered into the same decay 
profiles (despite large differences in actual transcript expression 
levels) may be regulated by the same post-transcriptional regula- 
tory pathways and/or contain similar regulatory sequences. Given 
the substantial complexity of the mammalian transcriptome, 
which continues to grow with increasingly sensitive technology, 
there is an increasing need for techniques to aid in the functional 
annotation of novel transcripts and variant isoforms, which now 
by far exceed the number of transcripts for which a function has 
been assigned. We suggest that stability profiling provides an im- 
portant facet in the characterization of the transcriptome in a 
given biological system and is an important consideration in the 
ranking of candidate transcripts to pursue for more detailed func- 
tional studies. 

Methods 

Cell culture and RNA extraction 

Mouse Neuro-2a (N2A) neuroblastoma cells (ATCC number CCL- 
131) and NIH-3T3 cells were cultured at 37°C, 5% C0 2 , in DMEM 
plus 10% newborn calf serum and penicillin/streptomycin. For 
stability experiments, cells were grown to —50% confluency before 
RNA polymerase activity was blocked with 10 |xg/mL actinomycin 
D (Sigma) in DMSO. Control cells were treated with DMSO alone. 
Transcriptional inhibition of N2A cells was conducted for 32 h, 
with cells harvested at time zero (0 h) and after 30 min, 2, 4, 8, 16, 
and 32 h. Inhibition of 3T3 cells was conducted for 24 h with 
harvesting at time zero (0 h) and after 30 min, 2, 4, 8, and 24 h. 
Cells were collected by trypsinization and total RNA extracted 
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using RNeasy columns (Qiagen) and treated with DNase. RNA 
samples were confirmed DNA-free by performing PCR for a geno- 
mic target on the purified RNA. Integrity of RNA for microarrays 
was confirmed using the Agilent 2100 Bioanalyzer (Agilent Tech- 
nologies). High quality, intact rRNA was found at all time points, 
although after 32 h of transcriptional inhibition, the first indi- 
cations of rRNA degradation were visible despite continued high 
RNA integrity number (RIN) values. Success of transcriptional in- 
hibition was confirmed by testing the stability of Myc mRNA (Fig. 
1A; Supplemental Fig. SI 3). 

Stability microarrays 

Stability microarrays were conducted using the NCode Mouse 
Noncoding RNA Microarray (Life Technologies), which contain 
27,281 probes targeted against protein-coding transcripts and 
7228 probes targeted against noncoding transcripts. Two color 
arrays were performed where transcriptionally inhibited time 
points were compared to the untreated 0-h sample. Four biological 
replicates were performed with two dye swaps. Control DMSO- 
treated samples were not analyzed by microarray. 0.5 |xg of total 
RNA was used for cRNA production using the Quick Amp Labeling 
Kit (Agilent). Equal micrograms were used because total RNA is 
comprised mainly of rRNA, which is highly stable, so the amount 
of rRNA, and hence total RNA, stays essentially the same even 
though mRNAs are decaying. Array hybridization was performed 
as per the Two-color Microarray Protocol (Version 5.7) (Agilent) 
and scanned using a G2565BA Microarray Scanner (Agilent). 

Stability array normalization 

Array data was processed using LIMMA (Smyth and Speed 2003; 
Smyth 2004; Ritchie et al. 2007) via the R Project for Statistical 
Computing (www.r-project.org). Background correction was per- 
formed via the minimum method. Between array normalization of 
0-h control samples was performed with G-quantile or R-quantile 
normalization (depending on the channel of the 0-h sample), 
while T-quantile normalization was performed for each transcrip- 
tionally inhibited time point to better retain transcript degradation 
signals. Scaling normalization was conducted using the previously 
identified highly stable gene Zfasl (Askarian-Amiri et al. 201 1) and 
six other stable genes. These were identified by comparing the top 
100 most stable mouse mRNAs from NIH-3T3 cells (Friedel et al. 
2009) against the present data and selecting those appearing to be 
up-regulated during later time points similar to Zfasl. The average 
fold change of this group of seven genes between each time point 
was calculated, set to zero, and the probe intensity data was scaled 
proportionately. Summary statistics for each probe including ex- 
pression values, fold change, adjusted P-values, and Bayesian (B) 
statistics were calculated using LIMMA. An eBayes (B statistic) prior 
of 25% was used. An appropriate expression cutoff was empirically 
determined by examining the ability of the microarrays to detect 
transcriptional decay as the 0-h expression level decreased and 
selecting a 0-h value which was sufficiently above background to 
allow accurate half-lives to be calculated. Raw and normalized 
microarray data is available at the ArrayExpress Data Warehouse 
(EMBL-EBI; ArrayExpress accession number E-MTAB-961). 

Half-life calculations 

Half-lives were calculated for probes above the expression cut-off 
of 25. Probe responses ranged from fast degradation to unchanged. 
Decay profiles were modeled by one-phase exponential decay 
(constraints: plateau > 0, K > 0) and/or linear decay using non- 
linear least squares regression using R (www.r-project.org). Probes 



showing statistically decreased expression were modeled with both 
one-phase decay and linear decay. Only fits with an R 2 > 0.7 were 
utilized. Where both models produced good fits, the model pro- 
ducing the better R 2 value was chosen. The remaining probes were 
modeled with linear decay only. Half-lives were calculated from 
a filtered set of probes shown by an F-test to have a significantly 
negative slope and with R 2 > 0.7. 

Clustering 

Unsupervised hierarchical clustering was performed using cluster3 
software (de Hoon et al. 2004). Clustering of all IncRNAs above the 
expression cut-off used the command line parameters -g 4 -e 2 -m 
m, which specifies the pairwise complete linkage method for the 
clustering and Pearson correlation for both the microarray and 
probe clustering. Normalized probe intensities are expressed rela- 
tive to 1 at time 0 h. Clustering of IncRNA genomic location classes 
used the command line parameters -eg a -g 0 -e 5 -m m, which 
centers each time point by subtracting the mean, utilizing the 
pairwise complete linkage method for the clustering and Spear- 
man's rank correlation for the IncRNA classes only. The resulting 
clusters were visualized using Java Tree View (Saldanha 2004). 

The unsupervised graph-based clustering and visualization of 
decay profiles was performed using QAPgrid. The clustering ap- 
proach and several illustrative examples on different data sets 
have been published previously (Inostroza-Ponta et al. 2011). The 
clustering algorithm receives as input the time expression values of 
14,987 probe sets. After computing the Pearson correlation values 
for all pairs of probe sets (x,y) — which we denote as r(x,y) — a 
distance matrix is calculated according to the formula d(x,y) = 
l-i(x,y). The clustering algorithm (MSTkNN) does not require 
user-defined parameters such as the number of clusters or number 
of elements per cluster. According to its built-in stopping criteria, it 
has returned 152 clusters. In turn, another matrix of distances is 
computed between each of these 152 clusters (for details of cal- 
culations, see Inostroza-Ponta et al. [2011]). With the input of 
these two matrices, we use a metaheuristic (a memetic algorithm) 
to solve a quadratic assignment problem defined over a grid. We 
obtain a layout in which, at least globally, highly similar expres- 
sion patterns among clusters and super-clusters are likely to be in 
close proximity in the final figure produced by the method. For the 
figures, each of the cluster's profile graphs was produced using 
stacked bar plots using R (www.r-project.org). To allow integration 
between half-lives, clustering, and transcript annotations, each 
cluster was linked to a list of the probes contained in the cluster, 
along with half-life and annotation information. Cluster enrich- 
ment analysis was performed with giprofiler and GATHER (Chang 
and Nevins 2006; Medina et al. 2010; Reimand et al. 2011). 

Annotation of coding and noncoding RNA transcripts 

Protein-coding gene structures were based on UCSC Known Genes 
(Fujita et al. 2011) and Refseq (Pruitt et al. 2009) as of April 2011. 
Noncoding transcripts were derived from the "All mRNA" track in 
the UCSC Genome Browser (Fujita et al. 2011) and were defined as 
noncoding if they had <5% overlap with protein-coding exons and 
an ORF less than 100 amino acids. PhyloCSF (Lin et al. 2011) was 
used to filter putative IncRNAs, and any transcripts with a score 
of 50 (or above) were defined as protein coding. A classification 
as protein coding was overridden if the transcript was found in 
IncRNAdb (Amaral et al. 201 1) or had been identified as binding to 
chromatin regulatory proteins (Guttman et al. 2011). Genomic 
locations of IncRNAs were defined as follows: Head-to-head tran- 
scripts from bidirectional promoters were defined as transcripts 
that originate from the antisense strand within 1 kb upstream of or 
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0.5 kb downstream from the 5 ' end of another RNA transcriptional 
start site. Tail-to-tail transcripts terminate on the antisense strand 
within 1000 nt downstream or 1000 nt overlapping the termina- 
tion site of another transcript. C/s-antisense transcripts were de- 
fined as those IncRNAs where at least 90% of their exonic length 
overlapped with a transcript on the opposite strand. Exclusively 
intronic ncRNAs were defined as those that initiated and termi- 
nated within introns of protein-coding genes in the sense orien- 
tation. The broader definition of intronic ncRNAs included any 
sense noncoding transcript where at least 75% of its exonic region 
covers the introns of a protein-coding gene. Intergenic transcripts 
were defined as ncRNAs not transcribed within 2 kb upstream of 
a protein-coding region on either strand or 30 kb downstream in 
the sense direction or 2 kb downstream in the antisense direction. 
3'UTR-associated transcripts (uaRNAs) were defined as any ncRNA 
that initiates within 30 kb downstream from a stop codon on the 
same strand [to allow for the potential of 3'UTRs that extend be- 
yond their annotations (Moucadel et al. 2007; Mercer et al. 2011)]. 
Promoter-associated transcripts were defined as single exon tran- 
scripts whose transcription was found within 3 kb upstream of and 
1 kb downstream from transcription start sites on either strand. 
Note that some genomic categories are mutually exclusive, but 
others are not. LincRNAs were from Guttman et al. (2009, 2010), 
IncRNAdb (Amaral et al. 2011), and Guttman et al. (2011) (chro- 
matin protein-binding lincRNAs only). We identified the half-lives 
for 144 transcripts from lincRNA regions. However, we only clas- 
sified 90 of these as noncoding. This apparent discrepancy is pri- 
marily a consequence of the conservative nature of our IncRNA 
classification but can also be attributed to updated Refseq anno- 
tations that class some lincRNAs as probable coding genes. There 
was no difference between the stability of those lincRNAs we clas- 
sified as coding or noncoding (P = 0.93). 

Puf family-binding sites were predicted using the UGUAN 
AUA consensus sequence (Xie et al. 2005). AU-rich elements were 
searched for using the AUUUA motif. The number of major and 
minor polyadenylation signals was calculated for the last 35 nt of 
transcripts. Major signals: AATAAA and ATTAAA. Minor signals: 
AATTAA, AAATAA, AGTAAA, AATATA, CATAAA, TAATAA, and AAT 
AAT. Transcripts with 12 or more A nt in the final 20 nt of a tran- 
script or first 20 genomic nt 3' from the transcript were identified 
as candidates for internal priming. 

GO analysis was performed with Fatigo (Medina et al. 2010; 
Reimand et al. 2011). Statistical analyses of IncRNA and mRNA 
features that define stability were performed using GraphPad Prism 
(GraphPad Software). 

N2A fractionation arrays 

N2A nuclear and cytoplasmic fractions were isolated using a vari- 
ation of a procedure described previously (Andersen et al. 2002). 
Cells were trypsinized, washed twice in ice cold PBS, and resus- 
pended in low salt buffer (10 mM HEPES, pH 7.9, 10 mM KC1, 1.5 
mM MgCl 2 , 0.5 mM DTT) on ice to swell (swelling was confirmed 
using a phase contrast microscope). Cells were transferred to an ice 
cold Dounce homogenizer and homogenized until at least 95% 
of cells were burst, but nuclei were still intact. Homogenate was 
centrifuged at 220g for 5 min at 4°C to separate the cytoplasmic 
fractions and nuclear-enriched pellets. The pellets were resus- 
pended in 0.25 M sucrose, 10 mM MgCl 2 buffer and overlayed on 
0.88 M sucrose, 0.5 mM MgCl 2 buffer, then were centrifuged at 
3000# for 15 min at 4°C to obtain pure nuclei. RNA was extracted, 
and three biological replicate microarrays performed and normal- 
ized as described previously (Askarian-Amiri et al. 201 1). An eBayes 
(B-statistic) prior of 15% was used. Data was uploaded and ana- 
lyzed using NRED (Dinger et al. 2009b). Probes with expression 



of log 2 ^ 7 were defined as expressed. Enrichment in either 
compartment was defined as > twofold expression enrichment, 
P-value < 0.05, B-statistic > 0. Raw and normalized microarray data 
is available at the ArrayExpress Data Warehouse (EMBL-EBI; 
ArrayExpress accession number E-MTAB-952). 

qPCR 

Reverse transcription was carried out with the Superscript III cDNA 
synthesis kit (Invitrogen) using random hexamers. Quantitative real- 
time PCR (qPCR) was performed using SYBR Green PCR Master Mix 
and real time cyclers (Applied Biosystems). Relative quantification to 
a stability control (Gapdh oxAtp5e) was performed for stability qPCRs. 
At least three biological replicates were performed for all transcrip- 
tionally inhibited time courses. Decay profiles were graphed and 
half-lives calculated using GraphPad Prism (GraphPad Software) as 
per Askarian-Amiri et al. (2011). The 0-h time point expression level 
was set to 100%, and the percentage of transcript remaining at 
each time point calculated. Half-lives were calculated by nonlinear 
regression with a least squares fit (plateau > 0, k > 0), using one- 
phase exponential decay, or, when no decay was observed, a linear 
line. Primer sequences are provided in Supplemental Table S6. 

Absolute quantification was performed for fractionation sam- 
ples. Equivalent micrograms of nuclear and cytoplasmic RNAs used 
for qPCR were converted to cell equivalents by multiplying the 
cytoplasmic expression level by the cytoplasmic/nuclear RNA ratio 
calculated from the RNA extraction yields. Nuclear: cytoplasmic 
enrichment ratios were calculated from three biological replicates. 

Western blotting 

Protein lysate concentrations were measured using the Bradford 
assay and then run on 12.5% polyacrylamide gels, which were 
Coomassie-stained to confirm equal loadings. Proteins from SDS- 
PAGE gels were transferred to PVDF membrane using a semi-dry 
procedure. Membranes were blocked overnight in 5% milk pow- 
der. Primary and secondary antibody incubations were carried out 
for 2 h. GAPDH (1:4000 dilution) (R&D systems), HISTONE H3 
(1:1000), anti-rabbit HRP (1:5000) (Cell Signaling Technologies). 
Proteins were visualized using ECL (GIBCO). 

FISH 

N2A and NIH-3T3 cells were seeded onto microscope cover slips 
and grown until —80% confluent. Cover slips were incubated in 
Extraction buffer (100 mM NaCl, 300 mM sucrose, 10 mM PIPES, 3 
mM MgCl 2 , 0.5% Triton, and 10 mM vanadyl ribonucleoside 
complex, VRC) for 1 min and then fixed with 4% PFA/PBS at 4°C 
for 40 min, followed by overnight incubation in 70% ethanol at 
4°C. Biotinylated oligos against mouse Neatl (5'-aaactgttattcccatc 
aacccacggttccaggcacatt-3 'biotin and 5 '-aatgacacaccactcaactaaattca 
gggcaggttggct-3 'biotin) or a scrambled control oligo (5'-gacatcct 
catcggaatttagggcatatcgcaatcgcgc-3 'biotin) were used at a final con- 
centration of 1 |jlM and denatured at 80°C for 10 min in 70% 
formamide. Oligos were made up to 20 juuL with hybridization 
buffer (0.4% BSA, 20% dextran sulphate, 4x SSC buffer, 2 U/|jlL 
RNase out) and incubated on cover slips at 37°C overnight. Cover 
slips were washed once in 15% Formamide/2x SSC, then in 2x 
SSC, and then in lx SSC. Streptavidin Alexa Fluor488 (Invitrogen) 
was diluted 1:500 in 4x SSC/0.4% BSA and incubated on each 
cover slip at 37°C for 1 h, followed by washing in 4x SSC, then 4x 
SSC/0.1% Triton, then again in 4x SSC. NONO mouse mono- 
colonal antibody (Souquere et al. 2010) was diluted 1:500 in PBS/ 
0.05% Tween-20 and incubated on cover slips at room temperature 
for 1 h, followed by three washes with PBS/0.05% Tween-20, then 
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probed with anti-mouse-TRITC secondary antibody (Jackson 
Immuno Research Laboratories) diluted 1:250 in PBS/0.05% 
Tween-20 for 1 h, followed by three washes in PBS/0.05% Tween- 
20. Cover slips were stained with DAPI (Sigma), mounted with 
Vectashield (Vector Laboratories), and z-stacks obtained using a 
Nikon TiE fluorescent microscope. 

Paraspeckle quantification 

Maximum projections of deconvolved z-stacks of NIH-3T3 cell 
nuclei (N = 18) and N2A cell nuclei (N= 31) from Neatl FISH with 
NONO and DAPI costaining were converted to grayscale in each of 
the three fluorescent channels used. The images were then ana- 
lyzed using CellProfiler (www.cellprofiler.org) using a modification 
of the "Speckle counting" pipeline (available on request) that is 
given on the CellProfiler website (http://www.cellprofiler.org/ 
examples, shtml). 

Data access 

Raw microarray data is available at the ArrayExpress Data Ware- 
house (EMBL-EBI; ArrayExpress accession numbers E-MTAB-952 
and E-MTAB-961). All normalized stability and localization array 
data is available online at http://stability.matticklab.com and/or in 
Supplemental Tables SI and S2. 
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